Extending the Lexicon
نویسنده
چکیده
This paper is concerned with the acquisition of the lexicon. In particular, we propose a method that uses analogical reasoning to hypothesize new polysemous word senses. This method is one of a number of knowledge acquisition devices to be included in DIRC (Domain Independent Retargetable Consultan0. DIRC is a kind of intelligent, natural language-capable consultant kit that can be retargeted at different domains. DIRC is essentially "empty-UC" (UNIX Consultant, Wilensky et al., 1988). DIRC is to include the language and reasoning mechanisms of UC, plus a large grammar and a general lexicon. The user must then add domain knowledge, user knowledge and lexical knowledge for the area of interest.
منابع مشابه
Wordnet creation and extension made simple: A multilingual lexicon-based approach using wiki resources
In this paper, we propose a simple methodology for building or extending wordnets using easily extractible lexical knowledge from Wiktionary and Wikipedia. This method relies on a large multilingual translation/synonym graph in many languages as well as synset-aligned wordnets. It guesses frequent and polysemous literals that are difficult to find using other methods by looking at back-translat...
متن کاملExtending the Lexicon by Exploiting Subregularities
ing Terminate-Conversation to ancestor concept Creating new metaphor: Mapping main source concept Killing to main target concept Terminate-Computer-Process
متن کاملExtending NomLex-PT using AnCora-Nom
This work describes how we used AnCora-Nom, a Spanish nominalization lexicon, to extend NomLex-PT, a lexical resource for Portuguese, originally based on the English NomLex lexicon and fully integrated to OpenWordNet-PT, our freely available Portuguese WordNet. The complete Spanish lexicon, which contains 1,655 entries, was translated to Portuguese and then compared to our previous data. Furthe...
متن کاملExtending a Lexicon Ontology for Intelligent Information Integration
One of the current research on the Semantic Web area is semantic annotation of information sources. On-line lexical ontologies can be exploited as a-priori common knowledge to provide easily understandable, machine-readable metadata. Nevertheless, the absence of terms related to specific domains causes a loss of semantics. In this paper we present WNEditor, a tool that aims at guiding the annot...
متن کاملCorpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
In this paper we present a tagger developed for inflectionally rich languages for which both a training corpus and a lexicon are available. We do not constrain the tagger by the lexicon entries, allowing both for lexicon incompleteness and noisiness. By using the lexicon indirectly through features we allow for known and unknown words to be tagged in the same manner. We test our tagger on Slove...
متن کاملAutomatically Extending the Lexicon for Parsing
This paper describes a method for automatically extending the lexicon of wide-coverage parsers. The method is an extension to the automatic detection of coverage problems of natural language parsers, based on large amounts of raw text (van Noord 2004). The goal is to extend grammar coverage, focusing in particular on the acquisition of lexical information for missing and incomplete lexicon entr...
متن کامل